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ABSTRACT 

Aims. Using the VIMOS Public Extragalactic Redshift Survey (VIPERS) we aim to jointly estimate the key parameters that describe 
the galaxy density field and its spatial correlations in redshift space. 

Methods. We use the Bayesian formalism to jointly reconstruct the redshift-space galaxy density field, power spectrum, galaxy bias 
and galaxy luminosity function given the observations and survey selection function. The high-dimensional posterior distribution is 
explored using the Wiener filter within a Gibbs sampler. We validate the analysis using simulated catalogues and apply it to VIPERS 
data taking into consideration the inhomogeneous selection function. 

Results. We present joint constraints on the anisotropic power spectrum as well as the bias and number density of red and blue galaxy 
classes in luminosity and redshift bins as well as the measurement covariances of these quantities. We find that the inferred galaxy bias 
and number density parameters are strongly correlated although these are only weakly correlated with the galaxy power spectrum. 

The power spectrum and redshift-space distortion parameters are in agreement with previous VIPERS results with the value of the 
growth rate /erg = 0.38 with 18% uncertainty at redshift 0.7. 

Key words, large-scale structure of universe, cosmology: observations,galaxies:statistics,cosmology:cosmological parame- 
ters,methods: statistical,methods:data analysis 


1. Introduction 


The distribution of galaxies on large scales provides a funda¬ 
mental test of the cosmological model. In the standard picture, 
galaxies trace an underlying matter density field and the statisti¬ 
cal properties of this field such as its powe r spectrum and higher 
order moments are given by the theory ( [Peebles [ |1980| ). This 
clear view is confounded, however, by the sparse distributions 
of luminous galaxies mapped by surveys ( jLahav & Suto||20()4| ). 
Galaxies are complex systems; they are biased tracers of the non¬ 
linear matter field and their clustering strength depends on their 
properties and formation histories (jPaVis & Geller |1976[ Blan-j 


ton et al.||2005t|Kai^|1984[|Bardeen et al.[|1986t|Mo & White| 


^ 1996| ). Furthermore, their redshift gives a distorted view of dis¬ 

tance, affected by coherent and random velocities ( |Kaiser|p"987| ). 

Upcoming galaxy surveys will be sensitive to subtleties in 
these trends requiring increasingly sophisticated modelling and 
numerical simulations to interpret the galaxy distribution in de¬ 


tail ([ The Dark Energy Surv ey C ollaboration ||2005[|LSST Dark 


Energy Sci ence Collaboration[|2012(]Levi etaI.||26l3[|Laureijs 
et al.p2011j ). At the same time, these surveys will be sufficiently 


large to be limited by minute selection effects that systematically 
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and significantly alter the observed distribution of galaxies. In¬ 
strumental and observational artefacts can masquerade as gen¬ 
uine astrophysical effects and vice-versa. Thus the analyses will 
need to track a large number of instrumental and astrophysical 
parameters and be able to characterise the covariances between 
them. Reliable error estimation will require incorporating the set 
of both systematic and random uncertainties. The stakes are high 
as experiments promise highly precise constraints on the nature 
of gravity, dark energy and dark matter ( jAmendola et al.||2013| ). 


Together, the physical and instrumental models compose the 
total data model. Given the large number of parameters, the 
Bayesian approach is often preferred over the frequentist one to 
set joint constraints on the relevant physical quantities ( jTrotta 


|2008| ). At the heart of this approach is Bayes theorem which 
dictates a recipe for translating a set of observations into con¬ 
straints on model parameters. Of fundamental importance is the 
incorporation of any prior knowledge of these parameters. This 
framework provides a natural means to jointly constrain physical 
parameters of interest while marginalising over a set of nuisance 
parameters. A paradigmatic e xample is the analysi s of cosmic 
microwave background data ( jWandelt et al.j |2004|) as demon- 
strated through the ESA Planck mission results ( [Planck Collab- 
joration et al.|[2015[ ). 
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Fig. 1: The VIPERS galaxy number density of the v5 internal 
release sample. The curves show the effect of the completeness 
corrections including the spectroscopic success rate (SSR) and 
target sampling rate (TSR). Our analysis uses the redshift range 
0.6-1.0 (vertical lines). 


The Wiener filter is the first example of the application 
of Bayesian reconstruction techniques to galaxy surveys. The 
Wiener solution corresponds to the maximum a posteriori so¬ 
lution given a Gaussian likelihood and prior. In general, for a 
signal contaminated by noise, the Wiener filter gives a recon¬ 
struction of the true signal with the minimum residual variance 
( [Rybicki & Press[[l992| ). This is true also for non-Gaussian sig¬ 
nal and noise sources, and for this reason, since the galaxy field 
is not Gaussian (it is thought to tend toward Gaussianity on very 
large scales), the Wiener filter has seen significant use in recon- 
struct ing the density field from galaxy surveys {e.g. |Lahav et aT) 
|1994| ) and in particular to predict large-scale structures behind 
the Galactic plane ( [Zaroubi et aH|1995] ). 

Wiener filtering is comparable to other adaptive density re- 
construction techniques su ch as Delaunay tessellations ( |Schaap| 
|& van de Weygaert |2000| ), although Wiener filtering offers the 
advantage of naturally accounting for a complex survey selec¬ 
tion function with inhomogeneous sampling. Examples of appli¬ 
cations of Wiener filtering to galaxy surveys include the Two- 
degree Field Galaxy Redshift Survey (2dFGRS) in which the 
Wiener filter was use d to identify galaxy c lusters and voids 
( [Erdogdu et all |2004| ). [Kitaura et al.| ( |2009| ) present a Wiener 
density field reconstruction of the Sloan Digital Sky Survey 
(SDSS) main sample. Applied to the VIMOS Extragalatic Red¬ 
shift Survey (VIPERS; Guzzo et al. 2014), the Wiener filter can 
naturally account for inhomogeneous sampling and survey gaps. 
In a comparison study of different density field estimators for 
VIPERS [Cucciati et al.| ( |2Q14| ) find that the Wiener filter per¬ 
forms well although it over-smooths the field in low density en¬ 
vironments affecting cell-count statistics. 

More physically motivated probability distribution functions 
have been developed to improve over the Wiener filter and ob¬ 
tain unbiased density field reconstructions. [Krtaura et alT] ( |2010| ) 


|et al.[[2004t|Granett et al.[ 2012| ). Also for the galaxy luminos¬ 
ity function estimates, maximum likelihood techniques have en- 
joyed significant use ([Ilbert et al.| |2005[ [Blanton et al.| [2003 [ 
[Efstathiou et al.[p~988j). 

Gaussian likelihood methods have only recently been devel¬ 
oped to jointly infer the density field, power spectrum and lu- 
minosity function f rom galaxy surveys ( [Kitaura & EnBlin|[2008t 
[EnBlin et ar|[2009[ ). The first application to the Sloa n Digital Sky 
Survey was demonstrated by [Jasche et al.[ ( |2010b[ ) who utilise a 
Gaussian likelihood and prior to jointly estimate the underly¬ 
ing galaxy field and power spectrum. This work was generalised 
to simultane ously estimate the l i near ga l axy bias and lum inos- 
ity function ( [Jasche & Wandelt[ [2013b[ ). [Ata et al.] ( [2015[ ) fur¬ 
ther model a scale-dependent and stochastic galaxy bias using 
the Log-normal Poisson model. The method ology has also been 
developed for photometric redshift surveys ([Jasche & Wandelt 

[MT^ . 

The full description of the galaxy field requires considera¬ 
tion of the higher order moments and depends on the physics of 
structure formation. Thus reconstruction methods have been de¬ 
veloped that incorporate physical models based on second-order 


perturbation theory ([Kitaura et al. [2012 Kitaura[ [20131 [Jasche 

[& Wandelt[ [2013a 

Jasche et al.[[2015[) or full N-body calcula- 

tions ([Wang et al.[ 

2014). Reconstructions of the local Universe 


have been used in novel ways, including to estimate the bias in 
the Hubble constant due to cosmic flows ( [Hess & Kitaura[[MT^ . 

In this work we carry out a Bayesian analysis of the VIMOS 
Extragalactic Redshift Survey (VIPERS; Guzzo et al 2014). Our 
goal is to jointly estimate the key statistics including the matter 
power spectrum, galaxy biasing function and galaxy luminosity 
function. Our strategy is, given the observed number density of 
galaxies in the survey as a function of position N(RA, Dec, z), to 
compute the conditional probability distribution for the param¬ 
eters, written schematically as: the matter over-density field S, 
galaxy mean number density N, galaxy bias b and the two-point 
correlation function S . The conditional probability distribution 
or posterior may be decomposed using Bayes theorem: 


p(S, N, b, S |V) oc p(N\S, N, b,S)p(S, N, b,S). 


( 1 ) 


demonstrate in a comparison study that the use of a Poisson 
sampling model for the galaxy counts with a log-normal prior 
on the density field allows better estimation of the lowest and 
highest density extremes on small scales. Th e generalisation of 
the model calls for a fully non-linear solver ( [Jasche & Kitaura[ 
[2010[ ). The Poisson log-normal model was used to reconstruct 
the density field probed by the SDSS sample ([Jasche et al.[ 
[MTOil ). 

The Gaussian likelihood has also been used to construct 
maximum a posteriori estimators for the galaxy power spec¬ 
trum ( [Efstathiou & Mood^ [200 1[ [Tegmark et al.[ [2002[ [Pope[ 


The first and second factors on the right-hand side are the data 
likelihood and the parameter prior. We will account for obser¬ 
vational systematics such as the survey selection function in the 
data model, but we will not propagate their uncertainties. Eor 
VIPERS, the uncertainties in the selection function are subdom¬ 
inant compared with the statistical errors and so the inclusion of 
the uncertainties will be reserved for future work. We adopt the 
Gibbs sampling algorithm to sample from the posterior distri¬ 
bution ( [Marin & Robert[[2007[ ). With this approach the complex 
joint probability distribution is broken up in a number of simpler, 
individual conditional distributions. Sampling these distributions 
allows us to build up a Markov chain that rapidly converges to 
the joint distribution. 

VIPERS has mapped the galaxy field to redshift 1 with un¬ 
precedented fidelity ( [Guzzo et"ar|[2014t[Garilli et al.[[2014[ ). So 
far, VIPERS data have been used to constrain the growth rate of 
structure through the shape of the redshift-space galaxy corre¬ 
lation function ( [de la Torre et al.[ [2013[ ). The cosmological in¬ 
terpretation of the galaxy power spectrum monopole has been 
presented by Rota et al (in prep.). The biasing function that links 
the galaxy and dark matter density has been estimated using the 
luminosity-dependent correlation function ( [Marulli et al.[ [2013[ ) 
and the shape of the 1-point probability distribution function of 
galaxy counts in cells ( [Di Porto et al.[[20T4} . Moreover VIPERS 
has tightened the constraints on the galaxy luminosity and stellar 
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Fig. 2: We construct subsamples of galaxies in colour, abso¬ 
lute magnitude and redshift bins. Left: we show the absolute 
magnitude-colour plane. The histogram at top shows the distri¬ 
bution of colour. We divide the sample into blue and red classes 
following [Fritz et ^ ( |2Q14| ) at Mu - My + 0.25z = 1.1. Right: 
the sample is further binned by redshift and absolute magnitude. 
The luminosity bins account for the mean evolutionary trend. 
The faintest luminosity bins are not volume limited and the thick 
blue and dashed red curves show the limiting magnitudes for 
blue and red galaxy classes. 


mass functions ( jPavidzon et al.||2QT^ [Fritz et al.||2014| ). These 
measurements firmly anchor models of galaxy formation at red¬ 
shift 1. 

Beyond the one and two point statistics of the galaxy field, 
galaxies are organised into a cosmic web of knots, filaments and 
walls that surround large empty voids. In VIPERS the higher or¬ 
der moments of the galaxy counts in cells distribution function 
have been measured (Cappi et al, 2015). While ongoing efforts 
are being made to measure the morphologies of cosmic struc¬ 
tures. A catalogue of voids has been constructed with VIPERS 
(Micheletti et al., 2014; Hawken et al., in prep). More gener¬ 
ally, the Minkowski functionals may be used to characterise the 
topology of large-scale structure as a function of scale (Schimd, 
in prep.). These measurements typically call for precise recon¬ 
structions of the density field corrected for observational system- 
atics such as survey gaps and inhomogeneous sampling. 

This work extends previous analyses by considering the joint 
distribution of galaxy luminosity, colour and clustering bias with 
the spatial power spectrum and density field. We begin in Sec.[^ 
with an overview of VIPERS and the parameterisation of the 
selection function. The data model is described in Sec. and 
the method is outlined in|^ In Sec we present the constraints 
from VIPERS data. 

We assume the following fiducial cosmology = 0.27, 
= 0.0469, Qa = 0.73, Us = 0.95, Hq = 70 km/s/Mpc, 
(Tg = 0.80. This coincides with the MultiDark simulation run 
( [Prada et aL|[20I2| ) that was used to construct the mock VIPERS 
catalogues. Magnitudes are in the AB system unless noted. The 
absolute magnitudes used were computed under a flat cosmol¬ 
ogy with = 0.30; however we transform all magnitudes to 
the = 0.27 cosmology. 


2, VIPERS 

The VIMOS Public Extragalactic Redshift Survey (VIPERS) 
is an ESO programme on VLT (European Souther Qbserva- 
tory - Very Large Telescope; [Guzzo et al.[ [20I4t [Garilli et~ar 
20I4| ). The survey targets galaxies for medium resolution spec¬ 
troscopy using VIMOS (Visible Multi-Object Spectrograph; [L^ 


[Fevre et ak) [2003 ) within two regions of the WI and W4 fields 
of the CFHTLS-Wide Survey (Canada-France-Hawaii Telescope 
Legacy Wide jCuillandre et~ar|[2012| ). Targets are chosen based 
upon colour selection to be in the redshift range 0.5-1.2. The 
final expected sky coverage of VIPERS is 24 deg^. 

For each galaxy, the 5-band rest-frame magnitude was esti¬ 
mated following the Spectral Energy Distribution (SED) fitting 
method described in [Davidzon et al.[ ( [20I3[ ) and adopted to de¬ 
fine volume limited samples. The choice of 5-band rest-frame is 
natural, corresponding to the observed /-band at redshift -0.8. 
We derived K-corrections from the best-fitting SED templates 
using all available photometry including near-UV, optical, and 
near-infrared. 


2.1. Sample selection 


This analysis is based on the VIPERS v5 internal data release 
which represents 11% of the final survey. We select sources from 
the VIPERS catalogue in the redshift range 0.6 - 1.0 with red¬ 
shift confidence > 95% (redshift flags 2,3,4,9). The redshift dis¬ 
tribution is shown in Fig. The sources have estimated rest 
frame Buser B-band magnitudes and U - V colours defined in 
the Johnson-Cousins-Kron system as described by [Fritz et aTT 
( [20I4| ). The total number of sources used in the analysis is 
36928. We construct subsamples of galaxies in bins of redshift 
(Az = O.I), luminosity (AMg = 0.5), and colour as illustrated 
in Fig. 1^ We separate red and blue galaxy classes using the cut 
defined by [Fritz et al.[ ([20I4|) at {Mu - + 0.25z = LI 

in the Vega system. In total there are 37 bins, 19 for blue and 18 
for red galaxy subsamples, including those that are not complete, 
see the discussion in Section 3.3. 


2.2. Survey completeness 


The VIPERS survey coverage is characterised by an angular 
mask ( [Guzzo et al.[ [20I4[ ). The mask is made up of a mosaic 
of VIMOS pointings, each consisting of four quadrants. Regions 
around bright stars and of poor photometric quality in CFHTLS 
photometric catalogue have been removed. 


Within an observed quadrant there are many factors includ¬ 
ing the intrinsic source properties, instrument response and ob¬ 
serving conditions that determine the final selection function 
( [Garilli et al.j [20I4[ ). The fraction of sources out of the parent 
photometric sample that are targeted for spectroscopy is referred 
to as the target sampling rate (TSR). Among the targeted sources, 
not all will give a reliable redshift measurement. We refer to this 
fraction as the spectroscopic success rate (SSR). The sampling 
rate is the product of the TSR and SSR: r = rrsRrssR- 


The arrangement of slits in VIMOS is strongly constrained 
since the sp ectra cannot overlap on the imaging plane ( [Bottini[ 
[et al.j [2Q05[ ). In VIPERS, the result is that the number of tar¬ 
geted sources in a pointing is approximately constant, damping 
the galaxy clustering signal both on small and large scales ( [Pollo[ 
[et al.[[2ofet[de la Torre et al.|[20I3[ ). 

Since, as described in Section 3, in our data model we bin 
the galaxies onto a cubic grid, it is only necessary to estimate 
the TSR correction on the scale of the cubic cell. For 5/z“^Mpc 
cells, this corresponds to 10 arcmin at z = 0.7, larger in fact, 
than the VIMOS quadrants. The TSR is estimated on a fine grid 
as the fraction of targets out of the parent sample within a 3 
arcmin circular aperture: Ntarget!^parent- The fine grid is then 
down-sampled to determine the average TSR in each grid cell. 
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Fig. 3: The distribution of VIPERS targets on the sky is plotted for the two fields W1 and W4. The points are coloured according to 
the target sampling rate (TSR) which is defined as the ratio of the number of targeted galaxies in a patch of sky over the total number 
available in the parent photometric catalogue. In this work we estimate the TSR within a circular aperture with radius 3 arcmin. The 
TSR depends on the projected density of targets on the sky. TSR is higher in low-density fields with few potential targets while in 
high density fields slit positioning constraints severely limit the number of sources that may be targeted. The inset histogram shows 
the cumulative fraction of targets with TSR below a given value. The median TSR over the survey is 48%. 
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Fig. 4: The spectroscopic success rate (SSR) quantifies the fraction of targets for which the redshift could be measured with >95% 
confidence. At left we show the mean SSR of each pointing divided by quadrant (7x8 arcmin). The inset histogram gives the 
cumulative number of quadrants with SSR below the given value. At right, we show the SSR as a function of Iab magnitude (solid 
curves) with an analytic fit (dashed curves). The sample is divided based on the overall quality of the quadrants quantified by SSR. 
The quadrants are ranked by mean SSR and the curves are computed for each decile. The range of SSR and the number of sources 
in the bin are given in the figure legend. 


The colours in Fig. [^indicate the TSR measurements as a func¬ 
tion of angular position (at the positions of observed galaxies). 

The spectroscopic success rate (SSR) is primarily correlated 
to the conditions at the time of observation and so varies with 
pointing. For a particular source the SSR depends on the ap¬ 
parent flux as well as the spectral features that are available to 
make the redshift measurement. We find that the primary contri¬ 
bution comes from the apparent flux, and we quantify the mean 


SSR, defined as Nmeasure d IN target ca ch quadrant, as a function 
of the i band magnitude ( [Guzzo et ari|2014[|Garilli et ah |2014 ). 
The degradation is most severe in poor observing conditions, so 
we compute the SSR separately according to the quality of the 
quadrant. We rank the quadrants based on mean SSR and com¬ 
pute separately rssRi^d in each decile. The SSR is fit with an 
analytic form: rgsRijn) = <^(1 - right panel of 

Fig. 1^ we see that SSR depends strongly on the quadrant quality. 
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For the top 10% of quadrants (shown by the red curve in Fig.|^, 
the SSR remains > 90%, but it drops quickly as quadrant qual¬ 
ity falls. What is important to note here is that the shape of the 
rssRiiAs) curves changes as a function of quadrant quality. 

The effect of the weights on the redshift distribution is shown 
in Fig. 


2.3. Mock catalogues 

We use a set of simulated (mock) galaxy catalogues constructed 
to match the VIPERS observing strategy. The catalogues are 
built on the MultiDark N-body simulation ( [Prada et |2012| ) 
using the Halo Occupation Distribution (HOD) technique. De 
tails of the construction may b e found in ( |de la Torre et al.||2013 
|de la Torre & Peacock[|2Q13] ). 

Galaxies were added to the dark matter halos in the simula¬ 
tion according to a luminosity dependent HOD model. The cor¬ 
relation function and number counts in luminosity bins were set 
to match measurements made at 0.5 < z < 1.2 in CFHTLS, 
VVDS and earlier releases of VIPERS. Each mock galaxy is 
characterised by its angular coordinate, comoving distance, “ob¬ 
served” redshift including its errors and an absolute magnitude 
in the B band. 

We partition the mock catalogues into bins of redshift and 
luminosity, but not in colour, as we do for the VPERS data. The 
step size in redshift and luminosity are Az = 0.1 and AMb = 0.2 
over the redshift range 0.6 < z < 1.0. We have 46 bins including 
those bins which are not complete due to the apparent flux limit. 

The mock galaxies match the number density of the VIPERS 
observations, but do not include the slit placement constraints 
that we correct in the data with the TSR weights. Nor do the 
catalogues simulate the spectroscopic sampling rate. 


3. Data model 

3.1. Galaxy number counts 

We overlay a three-dimensional cartesian grid on the survey. The 
number of galaxies in a given sample observed within a cell in¬ 
dexed by i is related to an underlying continuous galaxy density 
field Sg by 


Ni = Nwi(l^SG,i)^eG ( 2 ) 

where wi is the spatial selection function and Nt is the mean den¬ 
sity providing the normalisation. The stochastic nature of galaxy 
counts is captured by the random variable 6/ and is dominated 
by Poisson noise except in the highest density peaks ( |Di Porto| 
|et al.[ |20I4| ). The cells are defined in comoving redshift-space 
coordinates and we adopt a fiducial cosmology to define the re¬ 
lationship between redshift and comoving distance. 

The selection function w in Eq.|^gives the likelihood of ob¬ 
serving a galaxy at a given grid point. It accounts for the angular 
geometry of the survey, sampling rate and redshift distribution. 
In this analysis we separate the angular and line-of-sight compo¬ 
nents. As described in Section 5.1, the angular dependence is de¬ 
termined from the survey mask and target sampling rate (TSR) 
while the redshift distribution is computed assuming the lumi¬ 
nosity function and apparent flux limit for the given subsample 
of galaxies. 

The expected number of galaxies in a cell is given by the 
product of N and the selection function, giving 

(Ni) = Nwi. (3) 


The selection function as described can account for spatial 
variations but cannot describe sampling dependencies on galaxy 
type or apparent flux. For VIPERS data we will up-weight galax¬ 
ies based on the inverse spectroscopic success rate (SSR) de¬ 
pending on quadrant and apparent i band magnitude. These 
weights are only indirectly correlated with the density held so 
they result in an amplification of the shot noise level. The SSR 
weight of a galaxy i^wssr and the noise amplification factor is 
= {^ssr) averaged over ah galaxies in the subsample. There¬ 
fore, in Eq.^Ni represents the weighted count of galaxies and, 
consequently, the variance of the stochastic term, cr^, is boosted 
to acrl. 


In this analysis we discretise the galaxy held onto a coarse 
spatial grid as well as onto a finite grid of Fourier modes. This 
process introduces an error in the density held arising from the 
aliasing of structures: small-scale structures with spatial fre¬ 
quencies higher than the Nyquist frequency become imprinted 
on larger scales ( [Hockney & Eastwood| |I988| ). The effect may 


3y assuming the spec- 


be corrected for in the power spectrum _ 

tral shape above the Nyquist frequency ( Jingj 2005| ). However, 


to accurately reconstruct the density held without making such 
assumptions, we may use a mass-assignment scheme or anti¬ 
aliasing Alter discussed in Appendix The smoothing effect 
of mass-assignment schemes introduces a convolution in Eq. 
which invalidates the simple count model. An alternative is to 
use the super-sampling method of jJasche et al.| ( |2009] ) that ap¬ 
proximates the ideal anti-aliasing Alter and does not damp small- 
scale power. Since we desire a compact window in both configu¬ 
ration and Fourier space, we adapt this technique with a soft cut 
as described in Appendix This approach reduces the aliased 
signal to the level of the triangle-shaped-cell scheme while pre¬ 
serving small-scale power to k ~ O.lkNyquist- 

The convolution introduced by the anti-aliasing Alter modi¬ 
fies the noise properties such that the Poisson expectation, cr^ = 
NiWi, cannot be assumed. Instead we use a re-scaled Poisson 
variance characterised by the factor v/ = cr^/N. As described 
in Sec.[^ the factor may be estimated in a Monte Carlo fashion 
given the mask and anti-aliasing Alter. 

A cell partially cut by the mask will become strongly coupled 
with its neighbour through the anti-aliasing Alter. In practice we 
neglect the additional ofl'-diagonal cell-cell contributions in the 
noise covariance matrix. However, we found that it is necessary 
to regularise the noise matrix by increasing the noise level to 
achieve stable results. We set a lower limit on the cell variance 
through a parameter v threshold such that the scaled shot noise has a 
floor set by v/ <— mdix{vthresholds v/). The procedure is identically 
applied both to mocks and real data. 


3.2. Galaxy bias 

We assume a constant linear biasing model such that 

Sgj = bDizdSi, (4) 


where Zi is the redshift of the cell indexed by i. The bias factor b 
depends on the luminosity and colour of the galaxy subsample. 
We give explicitly the growth of matter fluctuations with time 
according to the linear growth factor D{z) = D(z)/D{zref) with 
^(z) = S(zref)D(z). For the VIPERS sample the reference redshift 
is set to Zref = 0.7. Since VIPERS covers an extended redshift 
range this factor brings large-scale density modes to a common 
epoch. 
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Fig. 5: VIPERS cone diagrams for the fields W1 (top) and W4 (bottom). The left panels show the redshift-space positions of 
observed galaxies. The marker colour indicates the blue or red colour class and the marker size scales with B-band luminosity. The 
depth of the slice is 10/z“^Mpc. The orange line traces the field boundaries cut in the redshift direction at 0.6 < z < 1.0. At right we 
show a slice of the density field taken from one step in the Markov chain. It represents the anisotropic Wiener reconstruction from 
the weighted combination of galaxy tracers. The field is filled with a constrained Gaussian realisation. The field has been smoothed 
with a Gaussian kernel with full-width-half-max 10/z“^Mpc. The colour scale gives the over-density value. 


3.3. Number density 

The number density of galaxies in a luminosity bin is given by 
the integral of the luminosity function: 

pMfaintiz) 

n(z)= n(M,z)dM. (5) 

'-'Mbright 

We parameterise the luminosity function using the Schechter 
function ( |Schechter| |1976| ) in terms of magnitudes and the pa¬ 
rameters (0*, M*, a): 

n{M) = 0.4 In 10 (6) 


The characteristic magnitude evolves as M^ {z) = M^(0) -r &z 
with 6 -1 for red galaxies in VIPERS (Fritz et al. 2014) 


confirming the findings of previous studies at moderate redshift 


( [Ilbert et SH|2005|[Zucca et al.||2009| ). 

The number density observed is further reduced by the sur¬ 
vey completeness. In VIPERS, galaxies are targeted to an appar¬ 
ent magnitude limit of mum = 22.5 in the Iab photometric band. 
This sets an absolute magnitude limit for a given class of galaxy 


~ rrilim Dmijf) ‘7C(z), 


(7) 


where D^iz) is the distance modulus which depends only on the 
background cosmology and is the K-correction term which 
depends on the particular type of galaxy targeted. The absolute 
magnitudes were computed for each galaxy by fitting spectral 
energy distribution templates to broadband photometry as de¬ 
scribed by pavidzon et al.| ( |2013| ) and from the absolute mag¬ 
nitudes we infer the K-corrections. We parameterise the trend 
of K-correction with redshift as 7C(z) = TCq + A^iz). TCq is es¬ 
timated from the median value of galaxies within a given sub- 
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sample while A/^(z) is a polynomial fit with the following coeffi¬ 
cients, different for red and blue galaxy types: 

AK,biue(z) = 1.784(2 - 0.7 f + 0.440(2 - 0.7) - 0.678 (8) 

AK,red(z) = 2.144(2 - 0.7 f + 1.745(2 - 0.7) - 0.720. (9) 

Using this parameterisation the inferred magnitude limits 
corresponding to 50% completeness are indicated in Fig. for 
the blue and red samples by the solid and dashed lines. For 
mock samples, the K-correction term and its evolution are fixed 
to ^mock ~ 1.3. 

With these ingredients we model the redshift distribution of 
each galaxy subsample by integrating the luminosity function 
with Eqs.|7]and[^ We let the mean density of each subsample 
free to set the normalisation of the redshift distribution. We then 
take the shape given by the Schechter function to interpolate the 
luminosity function across the bin. The parameters M* and a in 
each colo ur and redshift bin are fixed to the values measured in 
VIPERS ( [Fritz et ar||2014| ). Since the precise luminosity evolu¬ 
tion is not known, the evolution term, 6, is allowed to vary as a 
function of colour and redshift. This gives a characteristic mag¬ 
nitude M^(z) = M^(zref) + - Zref) where Zref is taken to be 

the midpoint of the redshift bin. Changing & modifies the shape 
of the redshift distribution. 


3.4. Power spectrum 

The matter power spectrum in real space P(k) = is as¬ 

sumed to be isotropic. Seen in redshift-space, it is distorted along 
the line-of-sight direction ( |Hamilton||1998| ). We model the sig¬ 
nal on the cartesian Fourier grid as 


S(k,ji;/3,cry,crobs) = A 


(l +/3tZf 

1 + 


'-SHk„ky,kz)Pik), 
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Parameter 

Symbol 

Dimension 

Overdensity field 

S 

2 X 72 X 16 X 172 
(5/z“^Mpc cubic cells) 

Power spectrum 

p 

109 

Distortion factor 

p 

1 

Velocity dispersion 

O-v 

1 

Galaxy bias 

b 

37 (19 blue, 18 red) 

Mean number density 

N 

37 

Luminosity evolution 

6 

8 


Table 1: Accounting of the free parameters in the data model. 


( 10 ) 

where /i = kiosik and k = ^k^ + + k^. The line-of-sight di¬ 

rection is aligned with the grid such that kios = K taking the 
plane-parallel approximation. 

The coherent motions of galaxies on large scales are de¬ 
scribed by the |Kaiser| (|1987|) factor with p 
5M is jO 


fjbg where the 

growth rate in ACDM is j\z) = dlogDIdloga. On small 
scales, velocities randomise and may be modelled by an ex¬ 
ponential pairwise velocity dispersion giving a Lorentzian pro¬ 
file in Fourier space which we refer to as the dispersion model 
( Ballinger et al.[ |1996| ). The velocity dispersion term, in 


Eq. 


10 


has units h ^Mpc. The conversion to velocity units is 
H{z)li\ + z)/ V2 60.0 hMpc~^ km s“^ which over the red- 
shift range of interest is nearly constant. We add a Gaussian term 
along the line of sight to characterise redshift measurement er¬ 
rors where CTobs = o-czlH(z) and cTcz is the redshift error. For 
VIPERS the estima ted redshift error is cTcz = 141(1-1- z)km/s 


(Guzzo et al. 2014) and CTobs = 1.67/z ^Mpc and is nearly con 


slant over the redshift range 0.6-1.0. 

The factor !B(kx, ky,kz) accounts for the cell window function 
arising from the anti-aliasing filter and is given by Eq. |A.2[ In 
this analysis the absolute amplitude of the power spectrum is not 
constrained. So we set the amplitude A in Eq.f^to fix erg = 0.8, 
the variance computed on a scale of R = 8 /t“^pc integrated to 
the Nyquist frequency. 

We ignore geometric distortions arising from the choice of 
the fiducial cosmology ( [Alcock & PaczynsE||1979| ). The result¬ 
ing bias is not significant when compared with the statistical 
uncert ainties of the VIPERS re dshift-space clustering measure¬ 
ments ( de la Torre et al.[|2013| ). However, when carrying out a 
model test, we may rescale the density field and two point statis¬ 
tics to transform from the fiducial to the test cosmology as car¬ 
ried out for the VIPERS power spectrum analysis by Rota et al 
(in preparation), but this is not done here. 


4. Generate ^p(p,crv\P^'^^,N\b\N) 

5. Generate W+i ^/7(A|Z?^+\A) 

6. Generate 6^+1 A) 

7. Generate <—A‘^^^ A) 

These steps are repeated forming a Markov chain and after an 
initial bum-in period we can expect that the samples are repre¬ 
sentative of the joint posterior distribution. 

In the first step, we sample from the conditional probabil¬ 
ity distribution for the density field in a two-stage procedure. 
Eirst, the Wiener filter is used to compute the maximum a-priori 
field SwF ( |Kitaura et al.[ |2QI0| ). The Wiener filter solution is a 
smoothed field that gives an underestimate of the true power. To 
generate a realisation of the density field a random component 
that is uncorrelated with the observations Srandom is added (|Jew-| 

et al. 1 12004] ). The final field is thus the sum S = Swf drandom- 

After constructing a realisation of the density field, the sec¬ 
ond step is to sample the power spectmm. We do put a Gaus¬ 
sian prior on the first bin at k < 0.01/zMpc“^ setting the mean 
and variance to the Eiducial value and sample variance expec¬ 
tation. This aids the stability of the chain. A uniform prior is 
used for the bins at ^ > 0.0I/tMpc“^ We use two appro aches 
to sampling the power spectrum detailed in Appendix |B.2[ Eirst, 
we draw samples from the inverse-gamma distribution, see e.g. 
|Jasche et'SI] ( |20I0b| ). However this produces very small steps in 
the low signal-to-noise regime and so can be inefficient at small 
scales. Therefore, on alternative steps we carry out a Metropolis- 
Hastings routine to draw sam ples of the power spectrum accord¬ 
ing to the likelihood, Eq. |B.2| We find consistent sampling of the 
power spectrum using the two methods. 

Since we cannot constrain the absolute normalisation of the 
power spectrum, we normalise to the desired value of erg. We 
next draw the redshift-space distortion parameters p, cry which 
are independent of the power spectrum amplitude. 

Next, we sample from the mean density conditional prob¬ 
ability distribution for each galaxy sample which includes the 
evolution factor 8. Here we use a Poisson distribution, as de¬ 
scribed in Appendix |B.4| 

Einally we sample from the bias conditional probability dis¬ 
tribution for each galaxy sample. Thi s dis tribution is Gaussian 
for the bias parameter (see Appendix |B.3| ). In this method, the 
bias is computed on the redshift-space grid, which in our case 
has a resolution of 5/t“^Mpc. Eor physical interpretation it is in¬ 
teresting to estimate the bias averaged on larger scales. So, in 
estimating the bias we first down-grade the grid resolution by 
a factor of two, such that the bias is averaged over a scale of 
I0/z“^Mpc. We impose a uniform prior for the bias values of 
0.5 <b< 4. 


4. Gibbs sampler 

We present a brief overview of the Gibbs sampler. Since our im¬ 
plementation differs from that of |Jasche & Wandelt] ( [20I3b| ) we 
provide a detailed description in Appendix]^ The full parame¬ 
ter set introduced in the previous section is summarised in Table 
We use the Gibbs sampling method to sample from the joint 
posterior of the parameter set. This is performed by iteratively 
drawing samples from each conditional probability distribution 
in the following steps (where <—indicates that a sample is drawn 
from the given distribution): 


1. Generate ^p(S\N\ b\_P\p\ A) 

2. Generate <^p(P\S^'^^,N\b\p\crl,N) 

3. Normalise power spectrum 


5. Application to VIPERS 

5.1. Set-up 

The data and mock catalogues are processed similarly, although 
the construction of galaxy subsamples differs. The mock cata¬ 
logues do not include the inhomogeneous incompleteness cor¬ 
rected for in the data by the SSR and TSR factors. The uncer¬ 
tainties introduced by these corrections are negligible compared 
with statistical uncertainties in VIPERS. 

I. The two survey fields, WI and W4 are separately embedded 
into rectangular boxes. The grids have dimensions 72 x 16 x 
172 cubic cells with comoving size 5/z“^Mpc. We align the 
grid such that at the field centre, the three axes correspond to 
the right ascension, declination and line-of-sight directions. 
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Fig. 6: Constraints on the real-space power spectrum. The lower panel shows the relative difference with the fiducial model. The 
black dots give our estimates of the binned real-space power spectrum taken from the median of the markov chain. Overplotted is 
the fiducial model adopted in this study (black dashed curve). We find agreement with the best-fit model using VIPERS data by 
Rota et al (in preparation) (purple dot-dashed curve). The pink dashed curve is the mean of the power spectrum estimates taken from 
the 27 mock catalogues. We present three error estimates: the internal chain variance determined from VIPERS data (grey steps), 
the chain variance determined from mock catalogues (blue steps) and the variance of the individual estimates from the 27 mock 
catalogues (red steps). The error corridors show 70% confidence intervals. 


The co-moving coordinates are computed using the fiducial 
cosmological model. In the real catalogue only, galaxies are 
up-weighted by the inverse spectroscopic success rate (SSR) 
depending on quadrant and apparent /-band magnitude. 

We compute the density on the grid using the anti-aliasing fil¬ 
ter based on the super-sampling method proposed by |Jasche| 
|et al.| ( |2009| ) with a soft ^-space cutt-off as described in Ap- 
pendixjAp"^ 

The angular (a, S) and radial (z) components of the selection 
function are computed separately on the grid: w(ai,Si,Zi) = 
w(ai,Si)w(zi). 


4. For the angular component, we generate a uniform grid of 
test points that over-sample the grid by a factor of 8 and reject 
points outside the survey angular mask. For VIPERS data 
the remaining are down-sampled by the TSR. The points are 
then assigned to the grid points using the anti-aliasing mass- 
assignment scheme. The selection function w/ is then given 
by the normalised density of test points on the grid. 

5. The radial component of the selection function is estimated 
in bins of redshift, luminosity and colour. We estimate the 
median ^-correction term for galaxies within each bin and 
use Eq.j^to compute the unnormalised N(z). 

6. We estimate the generalised shot noise variance v/ = cr^/Ni 
which depends on the mask through the anti-aliasing filter in 
Monte Carlo fashion. We generate a set of 1000 shot noise 
maps by distributing random points over the survey volume. 
For VIPERS data the points are down-sampled by TSR. We 
then compute the variance for each cell of the map over the 
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1000 realisations. To regularise the noise covariance matrix, 
a threshold is set v/ = msix(ythresh, v/) where vthresh = 0.3 for 
mocks and 0.15 for data to account for TSR. 

With the galaxy number density map, selection function and 
noise map, we have all the components of the data model re¬ 
quired to estimate the posterior probability distribution in Eq. 

To sample this probability distribution we run the Gibbs sampler 
Markov chain for 2000 steps and, allowing for a burn-in period, 
begin the analysis from step 1000. The convergence properties 
and justification for the burn-in period are shown in Appendix 
Eor the VIPERS data we ran 7 independent chains for 2000 
step each providing 7000 post-burn-in samples for analysis. 

Taking the variance of the Markov chain gives us an internal 
error estimate on the parameters. The runs on mock catalogues 
show that the chain variance corresponds to the expected sample 
variance for the power spectrum. However, this is not necessarily 
true for the other statistics. Eor instance the luminosity function 
quantifies the distribution of observed galaxies which remains 
fixed in the chain. It is only indirectly dependent on the underly¬ 
ing density. 

5.2. Density field 

The density field taken from a single step (1500) in the Markov 
chain is shown in Eig. It represents the application of the 
Wiener filter on a bias-weighted combination of the galaxy sub¬ 
samples. The reconstruction is based on the redshift-space dis- 
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P{k=0.A) 



P{k=QA) beff a, (km/s) /3 fas 

Fig. 7: We show the degeneracies between RSD parameters 
I3,cry, effective bias and the power spectrum at ^ = 0.4/zMpc“^. 
The shaded regions mark the 68% and 95% confidence inter¬ 
vals from VIPERS chain and the star symbols mark the mean 
value. The points give the distribution of mean values derived 
from mock catalogues. The histograms along the diagonal give 
the marginalised distributions of each parameter chain. The filled 
histogram gives the distribution from the VIPERS chain, while 
the solid line is the distribution of mean values derived from the 
mock catalogues. 


tortion model and so the resulting field is anisotropic and charac¬ 
terised by effective redshift-space distortion parameters averaged 
over the galaxy samples. 

The result of the Wiener filter is an adaptively smoothed 
field that extrapolates structures over the correlation length of a 
few megaparsecs. To build a full realisation of the structures we 
add a Gaussian constrained realisation that fills in the gaps and 
gives the full variance. The structures outside the survey bound¬ 
ary are generated from a random Gaussian realisation although 
the phases are properly aligned at the boundary. The true galaxy 
density field on scales of 5/zMpc“^ is far from Gaussian and the 
difference is visible by eye. 

In Fig. we can recognise the cosmic web of structures 
including knots, filaments and void regions. The structures are 
richest where the sampling is highest at lower redshift. At red- 
shift z > 0.8 we see fewer coherent structures and the contri¬ 
bution from the constrained Gaussian realisation is larger. Each 
step of the Markov chain gives a reconstruction of the field with 
different realisations of noise and large-scale modes. Once the 
chain has passed the bum-in period (see Appendix [^, we can 
consider these realisations to represent Gaussian perturbations 
around the observed galaxy field. 

5.3. Redshift-space power spectrum 

The galaxy power spectrum in redshift space is parameterised in 
terms of the real-space matter power spectrum, bias and redshift- 


space distortion factors (Eq.[^. We bin the power spectrum lin¬ 
early with bin size Ak = O.OI giving 109 bins. The redshift-space 
distortion parameters are fit to k < 0.4/zMpc“^ This limit corre¬ 
sponds to kcTy ^ I where we can expect the dispersion model to 
break down. 

The Markov chain provides joint samples of the parame¬ 
ters. In Fig. we show the median over the power spectrum 
chain (black dots). The confidence coordior gives the I-cr con¬ 
fidence interval estimated from the chain variance (grey steps). 
We find good agreement with the model co mputed with CLASS 
and Halofit (black dashed) with = 0.27 ( |Lesgourgues[ 201It 
Smith et al.[|2003[[Takahashi et al.[|20I2| ). The best-fitting model 
determined by Rota et al (in preparation) has = 0.272 + .03 
(over-plotted with purple dot-dashed curve). 

On small scales k > 0.45/zMpc“^ (0.75x Nyquist frequency) 
the power drops. This is due to neglecting correlations between 
cells that arise due to the anti-aliasing filter. On large scales, 
there is a dip in power aX k = 0.05/zMpc“^seen in both mock 
catalogues and data, although it biases the estimate only at the 
Icr level. Scales at k < 0.06/zMpc“^ are only measured in the 
line-of-sight direction with VIPERS so the inability to recon¬ 
struct them properly without a prior constraint is not surprising. 

The median values for the redshift-space distortion parame¬ 
ters we find are Pvipers = 0-41 and fimock = 0.47. The within- 
chain variance is crchain = (-0.12, -hO.IO) (68% confidence inter¬ 
val), while the scatter of the 26 mocks gives standard deviation 
rk" mock ~ 0.09. 

We may compute the growth rate through the relation 


— ^Cr^^galaxyiz), (H) 

where cr^^gaiaxy = be/fcr^. In this analysis we have fixed the am¬ 
plitude of the matter power spectrum at redshift z = 0.7 with 
cr^(z = 0.7) = 0.643 which corresponds to erg = 0.8 at z = 0 in 
the fiducial cosmology. 

We compute the effective galaxy bias as the number- 
weighted average over the galaxy samples. 


Kff = 


'LiiNiWiib] 

Z/,; NiWii 


( 12 ) 


where the sums are over the galaxy subsamples and selection 
function grids. 

We summarise the constraints on the growth rate in Table 
We find {foshiPERS = 0-38!o;p7 and (fosimock = 0-46!oat 
^ = 0.7 where we quote the chain variance. The scatter between 
the mocks gives a standard deviation CTf^mock = 0.08. Thus these 
constraints on the growth rate are in agreement with the VIPERS 
correlation function measurement by |de la Torre et al.| ( |20I3| ). 
There, the error was 16% on the growth rate /erg = 0.48 at z = 
0.8. In this work we find an error of 18%. We attribute the higher 
error in this analysis to the fact that we marginalise over the real- 
space power spectrum, while in the previous analysis it was fixed 
to a fiducial cosmology. 

The correlations between a subset of the power spectrum and 
redshift-space distortion parameters are shown in Fig.|7] The star 
symbols mark the median values of the parameters estimated 
from the VIPERS Markov chain while the filled contours give 
the 70% and 90% confidence intervals. The median value and 
marginalised 70% uncertainty on each parameter are labeled. We 
find that /erg (18% relative error) is better constrained than (3 
(20% error). This is due to the anti-correlation between beff and 
P with correlation coefficient p = -0.58. These correlations arise 
from the specific parameterisation adopted and would be modi¬ 
fied under a different data model. 
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The black dots in Fig. |7] represent the median values esti¬ 
mated from individual mock catalogues. We find that the value 
of the galaxy bias is different within the mocks (beff = 1.55) 
and VIPERS (be/f = 1.44). The bias of the mock galaxies is de- 
termined by the luminosity-dependent HOD prescription ( |de la 
[Torre et JT] |2013| ) and so the minor difference from real data is 
not unexpected. Accounting for the difference in bias, we find 
excellent agreement between the distribution of mocks and the 
parameters estimated from real data. The similarity of the PDF 
shapes also gives us confidence in the analysis method and error 
estimates. Furthermore, since the mocks do not include many of 
the selection effects present in the data the agreement suggests 
that these sources of systematic uncertainties are not influencing 
our conclusions. 


5.4. Colour and luminosity dependent galaxy bias 

We compute the galaxy bias from the variance of the galaxy 
counts on the grid. However, we first down-sample the grid by 
a factor of two such that the bias is computed on a scale of 
lor'Mpc. 

In Fig. we show the median bias values of the Markov 
chain and the confidence intervals are given by the chain vari¬ 
ance. The bias is computed in bins of redshift, luminosity 
and colour. We find a colour bimodality with red galaxies 
more strongly biased than blue. This corresponds to the well 
known galaxy morphology-density relationship that early type 
galaxies are predominantly found in high density environments 
(e.g. jCucciati et al.j |2006t [Dresslerj 1980} [Davis & Geller 


\\916) . Similarly, we expect to find that galaxy bias increases 
with galaxy luminosity since more massive and more lumi¬ 
nous galaxies tend to form in more massive dark matter clumps 
( [Coupon et al.[[MT^ . 

Previous studies with VIPERS data estimated the galaxy bias 
of the full galaxy sample as a function of luminosity and red- 
shift. jM^nTet^ ([20^ measured the projected galaxy corre¬ 
lation function in luminosity and redshift bins t o constrain the 
mean bias averaged over scales 5 - 20/z“^Mpc. Di Porto et al 


( |2014[ ) modeled the counts-in-cells probability distribution func¬ 
tion to estimate the linear bias. To compare our estimate of the 
galaxy bias with these previous results we construct luminos¬ 
ity threshold samples counting both red and blue galaxies. We 
cross-correlate these number density maps with the Wiener den¬ 
sity field from the core analysis and estimate the measurement 
uncertainty from the chain variance. The resulting bias values 
are shown by the markers with error bars in Fig. We find ex¬ 
cellent agreement with the previous analyses, although our red¬ 
shift bins differ. The bias values from [Di Porto et ak] ( [20141) have 


Marulli 


been taken on a scale R = 8/z“^Mpc while those of __ 

[et al.[ ( [20T3] ) are sensitive to smaller scales. The disagreement at 
z > 0.9 may indicate that the bias of luminous galaxies is scale 
dependent at high redshifts probed differently by the three stud¬ 
ies. 


5.5. Luminosity function 

In Fig. [T^ we show the derived luminosity function based on 
the mean galaxy number density (dots with error bars) for dif¬ 
ferent galaxy types. We compare the result to the analysis from 
Fritz et al.]([2014[ ) based on the STY ( [Ilbert et aT][2005t|Sandage[ 


et al.[[1979[ ) estimator (dashed curves). We can expect to find 


a difference in the two estimates arising from how the galaxy 
bias is treated. The STY estimate is designed to be independent 
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Fig. 8: The VIPERS galaxy bias parameters in redshift, luminos¬ 
ity and colour bins. A colour bimodality is seen in each redshift 
bin. The trend with luminosity is most striking in the lower red¬ 
shift bins for both blue and red galaxies. 


of the underlying density field under the approximation that the 
galaxy luminosity is uncorrelated with density. A strong lumi¬ 
nosity dependence of the bias can systematically tilt the inferred 
luminosity function ( [Smith [2012[ Cole[ [20lT] ). The agreement 
between our analysis and the STY measurement indicates that 
the VIPERS volume is large enough such that sample variance 
does not significantly alter the amplitude and that any effects due 
to the luminosity dependence of bias are weak. 


5.6. Parameter covariance 

We estimate the covariance of the statistics with the Markov 


chain. Fig. 11 shows the normalised correlation matrix deter¬ 
mined in our analysis. 


C 

O-lO-r 


(13) 


The bias and mean number density parameters are ordered first 
by luminosity and colour and then by redshift bin. The appear¬ 
ance of blocks in the matrix indicates that within redshift bins the 
statistics are strongly correlated. We also find that bias and mean 
density are anti-correlated, that is, increasing bias necessitates 
decreasing mean density to preserve the same fluctuation. The 
bias and mean density parameters are weakly correlated with the 
power spectrum measurement. 

The bins of the power spectrum (spacing Ak = 0.01/zMpc“^) 
show independence on large scales, as is expected for the Gaus¬ 
sian data model, but they become correlated at ^ > 0.3/zMpc“^ 
The correlations arise from the redshift-space parametrisation 
that couples the amplitude of the power spectrum to p and cTy. 
The upper right square in the figure represents the nearly 100% 
correlation between these two parameters. 
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p 

^P,chain 

^fi,mock 

i>eff 

^ b,chain 

^ b,mock 

/0’8 

^f,chain 

^f,mock 

Mock 

0.47 

-0.12/-L0.09 

0.09 

1.54 

-0.04/-L0.04 

0.03 

0.46 

-f0.09/-0.12 

0.08 

VIPERS 

0.41 

-0.08/-h0.07 


1.44 

-0.03/-f0.02 


0.38 

-0.07/-f0.06 



Table 2: The constraints on redshift-space distortion parameters. We give the 68% confidence intervals from the chains and the 
standard deviation amongst the 26 mock catalogues. The fiducial value is fcr^iz = 0.7) = 0.45. 
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Fig. 9: The galaxy bias measured from the full (red and blue combined) VIPERS galaxy sample in luminosity threshold bins. Ref¬ 
erence data are taken from the VIPERS projected correlation function analysis ( [Marulli et al.[|20I3] ) and counts-in-cells probability 
distribution function analysis dDi Porto et al.l 120141). Note that the redshift ranges differ. 



Mb -Slog/? Mb -Slog/? 


Fig. 10: The galaxy luminosity function inferred from the mean 
density Markov chain for red, blue and combined samples in red¬ 
shift bins. Markers are plotted at the median value of the chain 
and the height of the rectangles indicates the 68% confidence 
interval. The Schechter function fits from |Fritz et al.| ( |20I4| ) are 
overplotted for comparison. 

6. Conclusions 

Using VIPERS we have demonstrated a method to reconstruct 
the galaxy density field jointly with the redshift-space power 
spectrum, galaxy biasing function and galaxy luminosity func¬ 
tion with minimal priors on these parameters. The Bayesian 
framework naturally accounts for the correlations between these 



Fig. II: The normalised correlation matrix of the parameters 
computed from the VIPERS Markov chain. The blocks repre¬ 
sent the mean density, galaxy bias, power spectrum and RSD 
parameters. The structure in the covariance arises from the data 
model parameterisation. The values of luminosity and colour de¬ 
pendencies of galaxy bias and mean density within a redshift bin 
are strongly correlated, while they are only weakly correlated 
across redshift. On large scales the power spectrum covariance 
is diagonal, but at k > 0.3/zMpc“khe bins become correlated 
due to coupling of the small-scale power with the redshift-space 
distortion parameters. 


observables. We adopt a likelihood function for the galaxy num¬ 
ber counts that is given by a multivariate Gaussian and set a 
Gaussian prior on the density field. The solution that maximises 
the posterior distribution is given by the classical Wiener filter. 
To sample from the posterior distribution we add a Gaussian 
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constrained realisation. Incorporating this density field estima¬ 
tor within a Gibbs sampler, we jointly sample the full posterior 
distribution including the power spectrum, bias and luminosity 
function parameters. We find encouraging results using a multi¬ 
variate Gaussian model for the likelihood and prior distributions, 
although more theoretically motivated descriptions may be used 
( [Kitaura et al.[|2012l|Jasche & Wandelt]|2013a| ). 

There are clear gains when jointly estimating correlated pa¬ 
rameters. For instance the galaxy colour-density relation can 
be used to improve estimates of the density field. Furthermore 
it is well known that bias weighting galaxies when estimating 
the power spectrum leads to improved accura cy ([Percival et al. 


2004| ) and greater statistical power ( |Cai et al.j|20i 1| ). 


Moreover, the Bayesian framework provides a recipe for 
propagating uncertainties in the measurement, incorporating 
prior knowledge and constraints from the data and guarantees 
reliable error estimates. In VIPERS we account for inhomo¬ 
geneous sampling and detailed angular masks. We correct for 
the selection function of VIPERS by up weighting galaxies ac¬ 
cording to the magnitude-dependent spectroscopic sampling rate 
(SSR), while including the target sampling rate (TSR) in the an¬ 
gular dependence of the survey selection function. These correc¬ 
tions are fixed in our analysis, although for upcoming surveys it 
will be important to propagate the uncertainties in the selection 
function to the data products. 

Investigating the covariances between parameters, we find 
strong correlations between galaxy bias and number density pa¬ 
rameters within a given redshift bin. This is not unexpected since 
both these parameters depend on the one-point probability dis¬ 
tribution function of the density field. On the other hand the cor¬ 
relation with the power spectrum is weak. 

Our estimate of the power spectrum is effectively decon¬ 
volved from the survey window function (see Rota et al. in prep.) 
and we find that the covariance of the power spectrum bins is di¬ 
agonal on large scales as expected from an unmasked Gaussian 
random field. On small scales, k > 0.3/zMpc“^we find signifi¬ 
cant correlations between power spectrum bins. On these scales 
correlations are expected due to the physical processes of struc¬ 
ture formation; however, in this case the correlations arise from 
the parameterisation of the data model. There is a degeneracy 
between the redshift-space distortion factors p and cr^ and the 
amplitude of the power spectrum on small scales. Nevertheless, 
the error estimate given by the Gibbs sampler matches well the 
expectation of cosmic variance estimated from mock catalogues. 

Our results are in good agreement with previous VIPERS 
measurements. We find values of the redshift-space distortion 
factor p that are consistent with the correlation function analy¬ 
sis (|de la Torre et al. 


2013). The values of luminosity depen- 


dent bias we find follow the trends expected from |Marulli et al.| 
( |2013| ) and |Di Porto et al.| ( [2014| ) at z < 0.9. We further estimate 
the galaxy bias for colour samples finding a more pronounced 
dependence on luminosity for red galaxies than blue. The lumi¬ 
nosity function we infer from the mean number density matches 
well with those found by |Eritz et al.| ( [2014| ) using the STY esti¬ 
mator. 

In our analysis we have left the power spectrum, galaxy bias 
and number density without parameterisation. Despite this free¬ 
dom, we find that the resulting error in the key quantities such 
as the distortion parameter are only marginally larger than that 
given by traditional methods which can be strongly dependent 
on parameterisation. 

Our methodology may be extended to jointly analyse multi¬ 
ple datasets in a self-consistent manner. A particular challenge 
when considering multiple surveys is dealing with the differ- 
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ences in angular coverage, sampling rates and galaxy types. The 
Bayesian approach provides a means to homogenise datasets al¬ 
lowing for consistent measurements. 

Eor future studies with VIPERS we can consider the joint 
analysis wi th the VVDS-Wide spectroscopic survey ( [Garilli 


|et al.[ 2008| ). Although the two surveys partially overlap, the se¬ 
lection function and sampling rates differ prohibiting their direct 
combination. However, through the Bayesian framework, the 
joint analysis becomes natural. We may further add constraints 
given by the density reconstructions in the gaps by the ZADE 
algorithm ( [Cucciati et al.[|2014| ) or photometric redshift samples 
from the ful l CEHTLS Wide fields ( [Granett et al.||2012||Coupon 


|et aH|201^ . Sheer measurements in these fields can provide ad¬ 
ditional constraints on the underl ying matter density p roviding 
a powerful probe in combination ( Coupon et al.||2Q15] ). Eor up¬ 
coming surveys, this strategy will guarantee a complete and self- 
consistent picture of the Universe. 
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Fig. A.l: Top panel: an illustration of the mass-assignment ker¬ 
nels in two-dimensions: a hard k-space cut (Sup hard), a soft 
k-space cut (Sup soft), nearest-grid-point (NGP) and cloud-in¬ 
cell (CIC). The lower three panels give a comparison of the ker¬ 
nel functions (in one dimension) for various mass assignment 
schemes. Top: the kernels in Fourier space. The Nyquist fre¬ 
quency is 1 pixel“^ on the scale. Middle: the kernels in position 
space. Bottom: the cumulative power of each kernel in position 
space. 


Appendix A: Anti-aliasing filter 


Binning the continuous galaxy density field onto a grid may 
be characterised as a smoothing operation followed by discrete 
sampling ( [Hockney & Eastwood| |1988| ). In the simplest ap¬ 
proach, if particles are assigned to the nearest grid point (NGP) 
the smoothing kernel W(x) has a top-hat shape with the dimen¬ 
sion of the grid cell. More extended kernels can be chosen that 
distribute the weight of the particle over multiple cells. Common 
assignment schemes include cloud-in-cell (CIC) and triangle- 
shaped-cell (TSC) which correspond to iterative smoothing op- 
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Wavenumber 


Fig. A.2: A comparison of the shot noise power as a function 
of wavenumber under various mass assignment schemes. The 
Nyquist frequency is 1 pixel"^ on the scale. The top four plots 
show the principal component (thick line), the ^ = -1 harmonic 
and the sum of all harmonics (thin line). The bottom frame shows 
the fraction of aliased power for each assignment scheme. 


erations with the same top-hat filter. Smoothing the field damps 
small scale power, but better localises the signal in k-space 
which is beneficial in Fourier analyses. 

A consequence of using the fast Fourier transform (FFT) al¬ 
gorithm is that the signal must be discretised onto a finite number 
of wave modes or equivalently that periodic boundary conditions 
are imposed. The observed power is thus a sum of all harmonics 


of the fundamental wavelength which are known as aliases (ling 
2005} [Hockney & Eastwood||l988| ): 


P'{k) = Yj+ 2kNn)\^Pik + IkNti) 


(A.1) 


As long as the signal is sufficiently well sampled and band- 
limited, meaning that there is no power above the Nyquist fre¬ 
quency, the signal may be recovered without error. However, 
most signals of interest are not band limited, and so after sam¬ 
pling onto the grid, the harmonics overlap and the signal cannot 
be exactly recovered. 

The ideal anti-aliasing filter in k-space is the top-hat that cuts 
all power above the Nyquist frequency. However, the signal can¬ 
not be localised both in k-space and in position-space and a sharp 
cut in k-space will distribute the mass of a particle over the entire 
grid. 

A practical implementation of the ideal filter was developed 
by [Jasche et al.[ ( [2009] ). The method involves first super-sampling 
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the field by assigning particles to a grid with resolution higher 
than the target grid. Transforming to Fourier space via FFT, the 
high resolution grid is filtered setting to 0 all modes above the 
target Nyquist frequency. Finally the grid is transformed back to 
position-space and down sampled to the target grid. The tech¬ 
nique is very efficient with the additional trick that the down- 
sampling step can be carried out by the inverse FFT by cutting 
and reshaping the Fourier grid. 

The algorithm of |Jasche et al.| ( [2009] ) is limited only by the 
memory needed to apply the FFT on the high resolution grid, 
so it is competitive with common position-space cell assignment 
schemes. However, the sharp cut in k-space leads to an extended, 
oscillatory distribution in position-space. Such a decentralised 
and unphysical cell assignment is undesirable for estimation of 
the density field. _ 

An alternative approach was taken by |Cui et~ar (20081 who 
introduce the use of Daubechies wavelets to approximate the 
ideal filter while keeping compactness in position-space. We do 
not consider this technique here because the centre of mass of 
the wavelets adopted is offset from the particle position and so 
they are not well suited for estimation of the density field. 

As a compromise, we test a smooth cutoff with a k-space 
filter with the form: 


w(k,,ky,k,)= ]q 


1 


i={x,y,z) 


1 -f \kilkj^\^ 


(A.2) 


The parameter a adjusts the sharpness of the cut: we test a = 100 
(Sup ha rd) an d a = ^ (Sup soft). 

Fig. |A.1| shows the comparison of the mass assignment 
schemes including the super-sampling technique. We see that 
the hard cut (Sup hard) leads to ringing in position-space (the 
transform is the sine function). The amplitude is significantly re¬ 
duced when the parameter a = ^ (Sup soft) and the first peak is 
more compact than CIC. In k-space, the filter remains nearly 1 to 
~ O.lk^ while NGP, CIC and TSC functions are dropping. Be¬ 
yond the Nyquist frequency, the soft cut-off kernel drops faster 
than TSC without oscillatory behaviour. The bottom panel of |A.l| 

shows the cumulative power S(R) = \W(r)^dr. We see that 

soft-cut off scheme has compactness similar to CIC and NGP, 
while t he latt er schemes severely damp the total power. 

Fig. |A.2| compares the aliased power in the power spectrum 
measurement. The top four panels show the shot noise power 
multiplied by the kernels. The principal contribution (n = 0) and 
the first harmonic (^ = -1) are plotted along with the full sum of 
all harmonics. We see for instance, that the power measured with 
the NGP scheme mixes power from all harmonics. The hard cut¬ 
off filter performs nearly ideally with no aliasing effect. The soft 
cut-off also performs well matching the aliasing characteristics 
of the TSC scheme. 

To implement the soft cut-off filter we use the following 
practical super-sampling recipe: 

1. Assign particles to a grid with resolution increased by a fac¬ 
tor /. The assignment is done with the CIC scheme. We set 
/= 8 . 

2. Transform the field to Fourier space with the FFT. 

3. Multiply Fourier modes by the k-space filter. 

4. Transform back to position space with the inverse-FFT. 

5. Take a sample of the grid every / cells to down-sample to 
the target resolution. 


Appendix B: Gibbs sampier 

Here we give the algorithms used to sample from the conditional 
probability functions. 


Appendix B.1: Sannpling the density field 

Using a galaxy survey we count galaxies in a given sample I and 
construct the spatial field N/ which is a vector of length riceiis- We 
will write the set of m galaxy samples (e.g. different luminosity 
and redshift bins) as {N/} = ..., Each sample I has 

a corresponding mean density A/ and bias bi. 

We write the conditional probability for the underlying den¬ 
sity field 6 as 

p(smi {A/}, {bil S) cx ;7({N/}|^, m, {bil S)p(S\S) 

^samples 

= n PmS,Ni,bi,S)p{d\S) (B.l) 

l=l 

To write the last line we use the property that the number counts 
of different samples / are conditionally independent but depend 
on the common underlying density field. 

For clarity we now explicitly index the cells with subscript i. 
We adopt a Gaussian model for the number counts and write the 
log of the likelihood log p siS 

-2logp(Ni\Ni,bi,S) = 

- Niwij (I + biD(zi)St)f 

Yj - -^^ + log(2;rcr^,). (B.2) 

i=i ^i,i 

The variance of counts is generalised as cr^. and may be differ¬ 
ent from the Poisson expectation Aw due to the mask and anti¬ 
aliasing filter. However, we neglect noise correlations between 
cells which would introduce off-diagonal terms. The sums are 
carried out over cells with variance cr^. >0. 

i,i 

Next we consider the density field prior. We set a Gaussian 
model giving 


blcells blcells 

-21og/7(5|S) = y V + log(2;rdetS) (B.3) 

>1 i=i 


The correlation function of d is given by the anisotropic covari¬ 
ance matrix S. In Fourier space the matrix is diagonal and given 
by Eq. [T^ When writing the posterior we now drop the terms 
that do not depend on 6 giving 


-21og/7(^|{N/},{A/},{Z7,},S) = 
2 


- Nmj (1 + biDiSd) ^ 


l=\ i=l 


+ (B.4) 

Al 


Differentiating the log posterior with respect to 6 we find the 
equation for the maximum a posteriori estimator which is also 
called the Wiener filter. The estimate 6 is given by 


{Niw^biD,f , ^ 

l,i j=l 

NiWlibiDi f . 

l=l ^ l,i 


l=l 


(B.5) 


It is informative to point out how the Wiener filter operation 
combines the galaxy subsamples. The right hand side of Eq. [B3] 
shows the weighted combination, given by 


A/W/ ibiDi / - \ 

2 


(B.6) 
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where we consider cell i and the sum is over galaxy samples in¬ 
dexed by /. The subsamples are being weighted by their relative 
biases /?/. This form of weighting for the density field was de¬ 
rived by |Cai et al.| ( |2011| ) and is optimal in the case of Poisson 
sampling but also matches the weights for the power spectrum 
( [Percival et al.[|2004| ). 

To generate a resi dual field Sr with the correct covariance we 
follow the method of [Jewell et al.| ( [2004| ). We draw two sets of 
Gaussian distributed random variables with zero mean and unit 
variance: wij and W 2 ,/. The residual field is then found by solving 
the following linear equations. 




“2“ N,w,jb,Di 


l=l 


^l,i 




(B.7) 


^cells 



j=i 


(B.8) 


^samples U r\^ Tlcelh 




■ + li 


(B.9) 


i=i 


7=1 


The final constrained realisation is given by the sum Scr = S-\-Sr. 

The terms including the signal covariance matrix are com¬ 
puted in Fourier space where the matrix is diagonal. The trans¬ 
formation is done using the Fast fourier transform algorithm. 

and Eq. |B.9| using the linear conjugate gra¬ 
dient solver bi eg stab provided in the SciPy python library 
(http://www.scipy.org/). 


We solve Eq. B.5 


Appendix B.2: Sampling the signal 

The posterior distribution for the signal is 

p(S\S)ocp(S\S)p(S) (B.IO) 

We take a fiat prior leaving p(S |d) oc p(S\S) which is given by 
Eq. |B.3| This is recognised as an inverse-Gamma distribution 
for S which can be sampled directly ( jKitaura & EnBlinj |2008t 
jJasche et "31112010131 ). 

We parametrise the signal in Fourier space as the product 
of the real-space power spectrum and redshift-space distortion 
model (Eq. [T^. We sample the parameters in two steps. First 
we fix p and cr^ and draw a real-space power spectrum from the 
inverse-gamma distribution. 

The normalisation of the power spectrum is fixed by 

oi = ^ (f’kP{k)\W{k)\^ (B.ll) 

In the regime where shot noise is more important than cos¬ 
mic variance, this method of sampling is inefficient. An alter¬ 
native sampling scheme was proposed to improve the conver¬ 
gence ( [Jewell etari|2009[|Jasche et ar||2010b| ). The new power 
spectrum is drawn making a step in both P and S such that 
!P^S\ The consequence is that the conditional 
posterior p{P\S) is unchanged but the data likelihood is modified 
through S. We perform a sequence of Metropolis-Hastings steps 
to jointly draw P and S. We alternate between the two modes for 
sampling P on each step of the Gibbs sampler. 

The redshift-space distortion parameters p and are next 
sampled jointly. These parameters are strongly degenerate on the 


Article number, page 16 of 


17 


scales we consider. To efficiently sample these we compute the 
eigen decomposition of the Hessian matrix to rotate into a coor¬ 
dinate system with two orthogonal parameters. We evaluate the 
joint probability over a two-dimensional grid in the orthogonal 
space. Finally we use a rejection sampling algorithm to jointly 
draw values of p and cr^. 


Appendix B.3: Sampling galaxy bias 


The bias is sampled in the manner described by Jasche & Wan- 


[delt[ ( [2013b| ). The conditional probability distribution for the bias 
of a given galaxy sample is a Gaussian with mean given by: 


Y^NiWiDMNi-Ndlcr] 

=-;-^2- 

Xi{NmDid,) la] 

and variance 


la] 


(B.12) 


(B.13) 


After drawing a value from a Gaussian distribution with the 
given mean and variance, it is limited to the range 0.5 < /? < 4.0. 


Appendix BA: Sampling mean density 


We take a Poisson model for the mean density. Taking a uniform 
prior we have 


p(mN,b,s) 


oc 


oc 


p(N\S,b,N,S)p(N) 


n 


[ a ^ w , (1 + bD6)'\ 
AM 


-Nwi(\+bDiSi) 


(B.14) 

(B.15) 


Keeping only terms that depend on N we have 

log p(N) = Yj V log N-Nwi(l+ bDSi) (B. 16) 

i 

We draw a sample from this distribution by computing the like¬ 
lihood over a grid of values and using a rejection sampling algo¬ 
rithm. 


Appendix C: Convergence analysis 

The Gelman-Rubin convergence diagnostic ( [Gelman & Rubin 


[1992t [Brooks & Gelmanj [1998[ ) involves running multiple inde 


pendent Markov chains and comparing the single-chain variance 
to the between-chain variance. The scale reduction factor R is the 
ratio of the two variance estimates. In Fig. [C.l[ we show this con¬ 
vergence diagnostic for three parameters: the power spectrum at 
k = 0.2/zMpc“\ the distortion parameter p and the velocity dis¬ 
persion (Ty. These parameters are the slowest to converge. The 
R factor is computed up to a given maximum step i in the chain 
after discarding the first //2 steps. We consider the chain to be 
properly ‘burned-in’ when R < 1.2. Based on this diagnostic we 
set the maximum step in our analysis to be 2000 and discard the 
first 1000 steps 
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Fig. C.l: The Gelman-Rubin convergence diagnostic R com¬ 
puted from a single mock catalogue with 10 chains. 
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